Back

JAMA Pediatrics

American Medical Association (AMA)

Preprints posted in the last 7 days, ranked by how well they match JAMA Pediatrics's content profile, based on 10 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
Characterizing Documented Psychosocial Stressors in Pediatric Psychiatric Emergencies with an Open-Weight Large Language Model

Hartlage, C. S.; Manning, E. R.; Bernard, J.; Vaish, S.; Gray, J.; Young, M.; Pestian, T.; Folger, A. T.; Tachinardi, P.; Mendonca, E. A.; Brokamp, C.

2026-06-09 health informatics 10.64898/2026.06.08.26354931 medRxiv
Top 0.1%
4.9%
Show abstract

Objective: To evaluate whether a locally hosted open-weight large language model (LLM) can extract documented psychosocial factors from pediatric psychiatric intake notes and apply validated extraction to a large emergency psychiatry cohort. Materials and Methods: We identified emergency department presentations at Cincinnati Children's Hospital Medical Center from January 1, 2016, through December 31, 2024, among patients younger than 18 years with psychiatric billing diagnoses. Using full-text intake notes, gpt-oss:120b classified peer conflict, sleep disruption, and school-related academic, attendance, and disciplinary issues as detected, negated, or indeterminate. Four human raters independently reviewed 50 notes. We compared Fleiss' kappa among humans alone versus humans plus the LLM, assessed repeated-query stability across 50 independent calls per note, and applied the workflow to all eligible notes. Results: Among 37,315 eligible admissions, 22,284 had eligible intake notes; 22,270 produced parseable JSON. In detected-versus-not-detected coding, human-plus-LLM reliability did not differ significantly from human-only reliability across measures (human {kappa} 0.71-0.94; human-plus-LLM {kappa} 0.70-0.93). Stability was associated with human agreement: mean LLM-human agreement increased from 42.6% for classifications with less than 80% stability to 82.7% for classifications with 100% stability (Pearson r = 0.36). Full-cohort extraction showed frequent and overlapping documented factors: sleep disruption was most frequently detected (57.7%), followed by peer conflict (47.2%), academic issues (43.4%), disciplinary issues (43.3%), and attendance issues (16.9%). Discussion: Agreement varied by construct and was strongest when repeated model outputs were stable. Conclusion: Locally hosted open-weight LLMs can support scalable structured extraction of documented psychosocial factors from pediatric psychiatric intake notes after local validation.

2
Neonatal mortality risk of large-for-gestational age and macrosomic live births in low- and middle-income subnational birth cohorts: An individual participant meta-analysis (2000-2017)

Kirakoya Samadoulougou, F.; Barche, B.; Ukwishaka, J.; Subedi, S.; Erchick, D. J.; Suarez Idueta, L.; Hamer, D. H.; Semrau, K. E. A.; Hamomba, F. M.; Banda, B.; Manasyan, A.; Pry, J. M.; Maleta, K.; Ashorn, U.; Schmiegelow, C.; Hjort, L.; Minja, D. T. R.; Lusingu, J. P. A.; Freitas da Silveira, M.; Buffarini, R.; Baqui, A. H.; Khanam, R.; Ahmed, S.; Zhu, Z.; Zeng, L.; Cheng, Y.; Lachat, C.; Roberfroid, D.; Huybregts, L.; Toe, L. C.; Tielsch, J. M.; Khatry, S. K.; Mullany, L. C.; Ohuma, E. O.; Blencowe, H.; Katz, J.; Lee, A. C. C.; Black, R. E.; Hazel, E. A.

2026-06-06 public and global health 10.64898/2026.06.03.26354851 medRxiv
Top 0.1%
2.0%
Show abstract

Background Large-for-gestational-age (LGA) and macrosomic newborns are at increased risk of adverse perinatal outcomes, including death, yet the burden of neonatal mortality associated with these conditions in low- and middle-income countries (LMICs), where ongoing nutritional and epidemiological transitions suggest their prevalence will rise, remains poorly quantified. In this study, we quantify the neonatal mortality risk associated with LGA and macrosomia from 16 subnational birth cohorts in low- and middle-income countries between 2000 and 2017. Methods and findings This is an individual-participant meta-analysis to estimate neonatal mortality rates (NMRs) and relative risks among LGA infants (>90th and >97th percentile birth weight-for-gestational-age using INTERGROWTH-21st) versus appropriate-for-gestational-age (AGA, 10th-90th percentile) infants. Macrosomic ([≥]4000 g and [≥]4500 g) neonates were compared with those weighing 2500 g-3999g. Missing birth weights were imputed using recalibration and multiple imputation methods. We used random effects meta-analysis to pool relative risks. Median prevalences of LGA >90th and >97th percentile were 5.3% (interquartile range 3.6-8.2) and 2.6% (IQR 1.3-4.5), respectively; macrosomia ([≥]4000 g and [≥]4500 g) prevalences were 1.0% (IQR 0.3-3.1) and 0.06% (IQR 0.0, 0.30), respectively. Mortality was highest among preterm plus LGA infants (61.3 per 1000). LGA infants in the >90th percentile had over twofold increased mortality compared with appropriate-for-gestational-age infants (RR: 2.46; 95% CI: 1.86-3.25), while >97th percentile infants had a higher risk (RR: 3.77; 95% CI: 2.50-5.69). Term LGA >97th percentile infants also showed elevated mortality (RR: 3.14; 95% CI: 1.58-6.22). For LGA >97th percentile, the risk was higher in the early neonatal period (RR: 2.71; 95% CI: 1.92-3.82) than late (RR: 1.69; 95% CI: 1.22-2.34). There was no overall association between macrosomia ([≥]4000 g) and neonatal mortality. Population attributable fractions were 7.2% for LGA >90th percentile and 0.4% for macrosomia ([≥]4000 g). Conclusions Neonatal mortality risks were elevated among LGA infants in low- and middle-income countries, particularly at extreme values (>97th percentile) and during the early neonatal period. Macrosomia showed weaker, less robust associations. Although LGA prevalence is currently low ([~]5%) and contributes less to neonatal mortality than small newborns, ongoing nutritional and epidemiological transitions suggest increasing prevalence. This highlights the need for strengthened surveillance, monitoring, and improved delivery planning to ensure that no population is left behind.

3
Who Supports the Caregivers? Perspectives on Mental Health Screening in Paediatrics.

Coscini, N.; Giallo, R.; Grobler, A.; Hiscock, H.; Mulraney, M.; Pope, N.

2026-06-08 psychiatry and clinical psychology 10.64898/2026.06.04.26354967 medRxiv
Top 0.1%
1.5%
Show abstract

Objectives To explore caregiver and clinicians perspectives on implementing mental health conversations and supports for caregivers of children with chronic conditions in paediatric outpatient clinics. Specifically, views were sought on (a) screening approaches and measures (phase 1) and (b) how feedback and support could be provided to caregivers experiencing mental health difficulties (phase 2). Methods Caregivers and clinicians from two outpatient clinics (neuromuscular and diabetes) at a tertiary paediatric hospital in Melbourne, Australia participated in online focus groups in July and August 2024. Caregivers were recruited from outpatient clinics and clinicians were recruited via email. Both groups were combined for phase 1 before separating into breakout rooms for phase 2. Two authors conducted reflexive thematic analysis of transcripts using NVivo. Results Sixteen participants (caregivers n = 8; and clinicians n = 8) took part in in two semi-structured focus groups. Analysis generated two overarching domains, each comprising multiple themes. Domain 1, Addressing caregiver mental health, captured themes of overwhelm and invisibility, diverse caregiving roles, and the need for time and resources to support wellbeing conversations. Domain 2, Housing the mental health conversation, encompassed themes of screening preferences, caregiver agency in confidentiality, delivery of feedback, and access to tailored supports. Conclusions Caregivers and clinicians support routine caregiver mental health discussions in paediatric outpatient settings. Caregivers favour screening at diagnosis and key transitions, with clear, and actionable feedback delivered away from the child. Questions about record-keeping warrant further exploration, as do the perspectives of fathers.

4
WELL-ED: Wellbeing and Education linkages in school-aged children - A protocol for a population-based register study and survey of adolescents

Kosola, S.; Salonen, S.; Miettinen, J.; Horhammer, I.; Impio, A.-R.; Kumpulainen, S. M.; Sergejeff, J.; Numari, S.; Laitinen-Parkkonen, P.; Tapola-Haapala, M.; Aaltio, E.; Thorn, L.

2026-06-08 public and global health 10.64898/2026.06.06.26355053 medRxiv
Top 0.3%
0.5%
Show abstract

Introduction Education is a core social determinant of health for children and adolescents. Unfortunately, academic achievement, health, and wellbeing of adolescents have decreased in many developed countries in the past decade. The purpose of the Wellbeing and Education linkages in school-aged children (WELL-ED) study is to examine associations of school absences and academic achievement with use of school-based and community-based health and social welfare services. In addition, we will assess user experiences and multi-sector services pathways of school-aged children for a better understanding of how the service system could respond to the needs of children. Methods and analysis WELL-ED is a large population-based study that combines register data on school absences and educational support from municipalities with register data on healthcare and social service use collected from wellbeing services counties in Finland. The study cohort includes all children who attended mandatory education in public schools in Southern Finland in school year 2023-2024. A smaller cohort of adolescents in school year 8 was invited to complete a user experience survey. The primary outcomes of this study are related to equity of service use. Ethics and dissemination The Regional Committee on Medical Research Ethics of the Helsinki and Uusimaa Hospital District (2803/2024) has approved the WELL-ED study protocol. For the survey, adolescents in year 8 and parents of adolescents younger than 15 provided informed consent. Results will be published in peer-reviewed journals, summaries will be sent to participating municipalities and wellbeing services counties and press releases will be written on key findings.

5
Identifying Clinical Diagnostic Trajectories Associated With Suicide Death Using Temporal Sequence Mining of Linked Claims and Mortality Data

Belouali, A.; Kitchen, C.; Haroz, E.; Lehmann, H.; Nestadt, P. S.; Wilcox, H. C.; Kharrazi, H.

2026-06-10 health informatics 10.64898/2026.06.08.26355231 medRxiv
Top 0.4%
0.3%
Show abstract

Background: Most approaches to suicide risk assessment consider clinical conditions as independent risk factors, potentially overlooking prognostic information in the order in which conditions accumulate. We applied temporal sequence mining to linked claims and mortality data to identify ordered clinical diagnostic trajectories associated with suicide death. Results: The cohort included 3 647 059 insured Maryland residents aged 10 years or older with available claims records in the Maryland Suicide Data Warehouse from January 1, 2016, to December 31, 2020, among whom 768 suicide deaths were ascertained through medical examiner linkage. Sequential pattern mining of ICD-10-CM diagnoses grouped into Clinical Classifications Software Refined categories identified 89 221 candidate sequences, of which 1 816 remained significantly associated with suicide death in time-varying Cox models. Adjusted hazard ratios (AHRs) ranged from 2.4 to 134.1. Two-thirds of significant trajectories ended in physical conditions, and approximately half crossed from psychiatric to physical endpoints. Among suicide decedents, 62% were exposed to at least 1 significant sequence (median, 16 per case); median sequence duration was 18.7 months, and median time from completion to death was 13.1 months. In landmark analyses, among patients with depression who later developed suicidal ideation (n = 26 356), the path through anxiety, then anemia, was associated with higher risk (AHR, 4.6; 95% CI, 2.2-9.5), whereas the anxiety-only path was not (AHR, 1.3; 95% CI, 0.8-2.1). Among patients with anxiety who later developed hypertension (n = 149 215), the path through history of self-harm was associated with higher risk (AHR, 32.0; 95% CI, 16.6-61.6). Associations were generally consistent across sex and age. Conclusions: Temporal ordering of clinical conditions may carry prognostic information for suicide death. Clinical trajectories incorporating physical illness within psychiatric sequences identified higher-risk groups. These findings suggest that opportunities for risk detection may extend beyond psychiatric settings and that suicide risk signals may be fragmented across care settings and not apparent within isolated encounters.

6
Characterizing artificial intelligence (AI) psychosis in a large academic medical setting: evidence of the new clinical phenomenon and the vulnerability of those in early phases of psychosis

Bergson, Z.; Vassall, S. G.; Wright, A.; McCoy, A. B.; Schafer, K. M.; Achee, M. C.; Sheffield, J. M.

2026-06-08 public and global health 10.64898/2026.06.04.26354939 medRxiv
Top 0.5%
0.3%
Show abstract

Background: Concerns about "AI psychosis" have swirled in the media since ChatGPT's release, but few systematic analyses exist. We therefore conducted an electronic health record (EHR) analysis to identify the frequency, clinical characteristics, and quality of AI interactions in patients experiencing psychosis treated in a medical center. Methods: AI keywords (e.g., ChatGPT, AI) were used to search Vanderbilt University Medical Center's EHR from 12/1/2022-4/1/2026. Records were discarded if they were not AI-related or if the primary diagnosis did not include psychosis. Three raters read notes to determine if a patient was experiencing AI psychosis and classified the interactions using 4 a-priori categories (Catalyst, Amplifier, Co-Author, Object) formulated to explain how AI-related negative outcomes emerge. Findings: 73 patients met our criteria. 28 patients were rated as experiencing AI psychosis, 17 had neutral interactions, and 28 expressed delusional content related to AI without documented evidence of conversational AI use. ChatGPT was the matching keyword for 53.6% patients experiencing AI psychosis. The majority of AI psychosis cases were documented after ChatGPT's "4o" model was released in May 2024. Notably, the AI Psychosis group had significantly more patients experiencing a first psychotic episode (60.7%) compared to the other two groups. Amplifier was the most common (64.3%) qualitative rating in the AI Psychosis group. Interpretation: "AI psychosis" is an infrequent but real phenomenon observed in clinical practice. Most affected patients were experiencing their first psychotic episode and presented with AI psychosis following the release of the more sycophantic GPT-4o. Among the affected patients, AI most often exacerbated an existing condition by reinforcing distorted ideas.

7
Long-term Penetrance of Disease Variants in Genes Prioritized for Genomic Newborn Screening: Evidence from Adult Biobanks

Gold, N. B.; Zouk, H.; Yeo, J.; Lipsitz, S.; Koyama, S.; Somanchi, H.; Perez, E.; Selvaraj, M. S.; O'Grady, L.; Miller, E.; Lewis, A. C. F.; Karlson, E. W.; Strong, A.; Gold, J. I.; Rehm, H. L.; Natarajan, P.; Green, R. C.

2026-06-11 genetic and genomic medicine 10.64898/2026.06.10.26355380 medRxiv
Top 0.5%
0.2%
Show abstract

Importance: Genomic newborn screening (gNBS) is a potential public health intervention, but its positive predictive value (PPV) remains uncertain. Estimating the prevalence and penetrance of pathogenic and likely pathogenic (P/LP) variants in genes prioritized for screening may clarify the long-term PPV and clinical utility of gNBS. Objective: To compare ICD-based ascertainment, electronic medical record (EMR) review, and clinical assessment of genetic disorders in adults with P/LP variants in 54 genes prioritized for gNBS. Design: Two-cohort observational study with EMR review and clinical assessment in the hospital-based cohort. Setting: The U.K. Biobank (UKB) and Mass General Brigham Biobank (MGBB). Participants: 451,877 adults from the UKB and 53,371 from the MGBB, all with exome sequencing data. Exposures: P/LP variants in 54 genes prioritized through expert consensus for gNBS, in genotypes consistent with each gene's inheritance pattern. Main outcomes and measures: The primary outcome was the absolute difference in the proportion of MGBB participants identified as affected by ICD versus EMR ascertainment. Secondary outcomes included findings from clinical assessments of undiagnosed MGBB participants, corrected UKB penetrance estimates, and extrapolation to U.S.. annual birth cohorts and living adults. Results: P/LP variants were identified in 665 UKB participants (0.15%) and 82 MGBB participants (0.15%), approximately 1 in 650. In MGBB, EMR review revealed that 58/82 individuals (70.7%) were undiagnosed, although 25 of 58 (43.1%) had documented symptoms. Disease-associated ICD codes were found in 39.0% (32/82) of participants, whereas EMR review identified symptoms in 59.8% (49/82, McNemar P<.001). Applied to UKB, this correction yielded a penetrance of 28.4% (95% CI, 18.6% to 38.2%), implying that 73 to 203 participants beyond the 51 identified by ICD codes may have clinical features of disease. Extrapolated to U.S. birth cohorts, 4,900 to 5,700 newborns per year may harbor P/LP variants in these genes and survive into adulthood. Approximately 355,000 to 410,000 U.S. adults may have P/LP variants in these genes. Conclusions and relevance: Penetrance of P/LP variants in genes prioritized for gNBS is substantially higher than ICD estimates suggest. Many adults with P/LP variants are symptomatic but undiagnosed, supporting inclusion of these genes in gNBS.

8
The developmental trajectory of EEG alpha coherence in autistic toddlers with and without language delay

Mandl, S.; Chung, H.; An, W. W.; Thomas, R. P.; Bose, A.; Faja, S.; Wilkinson, C. L.

2026-06-09 pediatrics 10.64898/2026.06.03.26354124 medRxiv
Top 0.6%
0.2%
Show abstract

Although language acquisition delays are frequently observed in children with autism spectrum disorder (autism), our current understanding of the neurobiological mechanisms underlying language development in autism is sparse. Previous studies have found resting-state electroencephalography (EEG) power to be associated with language abilities in autistic children. However, longitudinal studies examining resting-state EEG phase coherence in relation to language development in preschool-aged children with autism are limited. This study aimed to characterize age- and group-related changes in whole-brain coherence in neurotypical children and in autistic children with and without language delay. Resting-state EEG and language data were collected at 2, 3, and 4 years of age. Peak phase coherence within the alpha band (6-11 Hz) was calculated at each timepoint and differences in the developmental trajectory of peak alpha coherence (PAC) were analyzed. In neurotypical children, PAC increased between 2 and 4 years of age. In contrast, PAC did not significantly change with age in children with autism. However, when examining autistic children based on language delay status, PAC increased with age in autistic children without language delay, but not in children with language delay. Exploratory analysis revealed evidence for an interaction between PAC and age, suggesting that the direction of the association between PAC and VDQ varied across age. Overall, these results support previous findings of altered oscillatory connectivity in autism and suggest that differences become apparent early in development. Importantly, phase coherence may not only differentiate diagnostic groups but also capture meaningful variability within the autism group. Future research should further investigate the use of EEG coherence as a biomarker of language development in autism.

9
Medical discrimination and the selective erosion of institutional health trust: evidence from the Health Information National Trends Survey 6 and 7

Park, A.; Yin, L.; Wong, A.; Lee, C.; Choi, Y.

2026-06-09 public and global health 10.64898/2026.06.06.26355057 medRxiv
Top 0.6%
0.2%
Show abstract

Medical discrimination may alter how patients relate to health information sources following adverse care encounters. We examined whether discrimination experience is associated with selective erosion of institutional health trust and with compensatory digital health engagement, using nationally representative data from the Health Information National Trends Survey (HINTS) 6 (2022; n=6,252) and HINTS 7 (2024; n=7,278). Survey-weighted modified Poisson regression estimated prevalence ratios (PRs) for binary high-trust outcomes, and survey-weighted ordinary least squares estimated coefficients for continuous outcomes; jackknife replicate weights (50 replicates) provided variance estimates. Discrimination was associated with substantially lower probability of high trust in the healthcare system (PR=0.39; 95% CI 0.30-0.52) and physicians (PR=0.85; 95% CI 0.77-0.94), with no significant association for trust in scientists, government, family, or religious organisations. The clinical-institutional pattern replicated in HINTS 6, which additionally showed reduced trust in scientists for race/ethnicity-based discrimination. Contrary to a disengagement hypothesis, discrimination-exposed adults showed higher probability of online health information seeking (PR=1.06), health app use (PR=1.11), and online provider messaging (PR=1.13); these associations persisted after adjustment for trust in physicians. Discrimination was independently associated with lower health self-efficacy (b=-0.271). Medical discrimination selectively erodes trust in clinical institutions while leaving broader epistemic trust largely intact. Despite this, discrimination-exposed patients engage more actively with digital health channels, consistent with compensatory reorientation toward non-clinical information sources. These findings describe engaged but institutionally alienated patients, with implications for restoring clinical trust and for equity-centred digital health design.

10
A Comparison of Manual and Automated Approaches to Developing Computable Algorithms for Identifying Acute Pancreatitis

Bann, M. A.; Carrell, D. S.; Gruber, S.; Heagerty, P. J.; Williamson, B. D.; Nelson, J. C.; Hazlehurst, B.; Felcher, A.; Nyongesa, D. B.; Slaughter, M. T.; Sapp, D. S.; Cronkite, D. J.; Ball, R.; Floyd, J. S.

2026-06-08 health informatics 10.64898/2026.06.05.26354934 medRxiv
Top 0.7%
0.2%
Show abstract

Objective: Clinical phenotyping methods that rely on clinical and informatics expertise can be time-intensive and costly. We tested both manual and highly automated approaches using electronic health record (EHR) data to identify an FDA Sentinel Initiative health outcome of interest, acute pancreatitis. Materials and Methods: We trained and evaluated machine learning algorithms using EHR data with two approaches: a custom approach that included manually curated features and trained on outcomes data validated with medical record review, and a highly automated approach that greatly simplifies and automates feature engineering and relies on low-cost silver-standard outcomes for model training. Results: Custom algorithms using manually curated structured claims data discriminated cases from non-cases with a high degree of accuracy (cv-AUC 0.89 [95%CI 0.84-0.94]); the inclusion of natural language processing (NLP)-derived covariates from clinical notes increased performance slightly (cv-AUC 0.91[95%CI 0.86-0.97]). The automated algorithm trained on the outcome count of diagnosis codes performed less well (AUC 0.80 [95% CI 0.75-0.85]) but improved using maximum lipase value as an outcome (AUC 0.88 [95% CI 0.84-0.92]). At a positive predictive value of 90%, the custom algorithm had a sensitivity of 92%, the automated algorithm trained on diagnosis code count had a sensitivity of 45%, and the automated algorithm trained on maximum lipase value had a sensitivity of 84%. However, a prediction rule derived by clinicians during chart review was nearly as accurate (maximum lipase value [&ge;] 3 times upper limit of normal; AUC 0.86, PPV 85%, sensitivity 92%). Discussion: Machine learning algorithms with manually curated structured data and NLP features trained on validated outcomes data successfully identified validated events. Use of an outcome in the automated model based on specific phenotype knowledge (maximum lipase value) allowed for performance similar to the custom model and with considerably less resources.

11
Metatranscriptomics-Derived Disease Risk Scores as a Preventive, Diagnostic, and Treatment Support Tool

Hu, L.; Bass, M.; Patridge, E.; Molusky, M.; Antoine, G.; Vuyisich, M.; Banavar, G.

2026-06-06 genetic and genomic medicine 10.64898/2026.05.29.26354333 medRxiv
Top 1.0%
0.1%
Show abstract

Background: Chronic diseases and symptom syndromes often develop after prolonged biological changes that may precede formal diagnosis. RNA-based metatranscriptomics captures active microbial and human gene expression and may provide a functional layer for disease risk evaluation. To address this translational gap, we developed and validated a Disease Risk Score (DRS) framework that integrates metatranscriptome-derived pathway activity scores from stool, saliva, and blood samples, and evaluated its potential clinical utility as an adjunct risk-evaluation tool. Methods: DRS uses disease-specific sets of pathway activity scores derived from stool and saliva microbial functions, stool and saliva microbial taxa, and blood human gene expression. For each disease, 'not optimal' pathway scores are aggregated into a normalized cumulative odds ratio, or cOR, using score-level odds ratios, statistical significance, and literature-supported biological relevance derived from a Development Cohort of 22,369 individuals. A cOR [&ge;] 5 is defined as high risk. Performance is evaluated in an independent Validation Cohort of 15,908 individuals using self-reported diseases as the reference. Disease support requires both significant cOR separation between self-reported and not-reported (Cohen's d [&ge;] 0.2) and risk ratio enrichment of self-reported disease among individuals classified as high risk (95% CI of Risk Ratio > 1). Results: Of 20 initially evaluated diseases, 15 meet the prespecified validation criteria on the independent validation cohort: ADHD, anxiety, chronic fatigue syndrome, depression, GERD, hypertension, inflammatory bowel disease, IBS-C, IBS-D, insomnia, MASLD, obesity, obstructive sleep apnea, Sjogren's syndrome, and type 2 diabetes. Five selected clinical scenarios illustrate how DRS can support clinician-mediated decision making, including IBS subtype reclassification, improved diagnostic acceptance in IBS-D, personalized lifestyle counseling in MASLD and early type 2 diabetes, and diagnostic uncertainty in atypical GERD. Conclusions: DRS is a metatranscriptomics-based risk-stratification framework that aggregates active microbial and human pathway signals into interpretable disease-specific risk estimates across a wide range of disease conditions. Validation against self-reported disease labels in an independent cohort shows significant risk enrichment for each of 15 diseases. DRS is intended as an adjunct to clinical evaluation: a decision support tool in situations where routine care encounters uncertainty, delay, or low patient engagement. Future prospective studies using clinically adjudicated endpoints are needed to assess calibration and clinical outcomes.

12
High coverage, persistent gaps: quality of Antenatal Care and its determinants in Zambia based on the 2024 Demographic and Health Survey.

Tukamuhebwa, P. M.; Nuwabaine, L.

2026-06-12 public and global health 10.64898/2026.06.11.26355447 medRxiv
Top 1%
0.1%
Show abstract

Abstract Background Evaluating antenatal care (ANC) quality is critical to reducing maternal and neonatal mortality. In Zambia, despite high basic ANC attendance, comprehensive national evidence on the clinical content and quality of services remains limited. This study assessed the coverage of WHO-recommended ANC interventions and identified factors associated with care quality using the latest national data. Methods A cross-sectional analysis was conducted using data from the 2024 Zambia Demographic and Health Survey. The final analytic sample comprised 4,829 women aged 15-49 with a live birth in the preceding 5 years. A composite index of 15 selected, equally weighted WHO-recommended components evaluated clinical assessment, counseling/screening, preventive interventions, and utilization. Survey-weighted Poisson regression estimated adjusted incidence rate ratios (aIRRs) for the count of ANC components received. Results The mean ANC quality score was 12.5 out of 15 (95% CI: 12.4-12.6), and 78.5% (95% CI: 77.0-80.0) of women achieved adequate ANC ([&ge;] 12/15 components). While individual clinical and counseling coverage generally exceeded 90%, only 47.2% (95% CI: 45.3-49.0) of women initiated care during the first trimester, and just 4.8% (95% CI: 4.1-5.6) achieved [&ge;] 8 ANC contacts. Maternal education was the strongest and most stable predictor of quality across all models. Compared to no education, higher education was associated with an 8.0% higher expected quality score (aIRR = 1.080, 95% CI: 1.051-1.110). Lower ANC quality was significantly associated with unwanted pregnancies (aIRR = 0.970, 95% CI: 0.956-0.993) and with residence in Western (aIRR = 0.923, 95% CI: 0.897-0.951) and North Western (aIRR = 0.966, 95% CI: 0.937-0.996) provinces. Absence of distance barriers and residence in Eastern, Luapula, and Copperbelt provinces were associated with higher quality scores. Conclusion While average ANC component coverage in Zambia is high, critical gaps persist in early initiation and total contact frequency. Care adequacy is strongly influenced by maternal education, relationship status, pregnancy intention, and regional inequities. These findings underscore the need for interventions targeted at uneducated women, preventing unintended pregnancies, and underserved regions such as Western and North Western Provinces. Keywords: Antenatal care quality, ANC content, Zambia, maternal education.

13
Interoceptive accuracy and attention across multimorbidity classes: A latent class analysis

Mulder, J.; Boeker, C. M.; Smit, A. K.; Kiefte-de Jong, J. C.

2026-06-09 public and global health 10.64898/2026.06.08.26355147 medRxiv
Top 1%
0.1%
Show abstract

Background Multimorbidity is increasingly prevalent, and associated with worse clinical and psychosocial burdens. Interoception, the brain's ability to sense and interpret internal bodily signals, may contribute to multimorbidity, through its link with health behaviors, stress regulation, and mental health. This study examines whether self-reported interoceptive accuracy and attention is associated with multimorbidity, by identifying multimorbid subgroups and their interoceptive profiles. Methods Morbidity classes were identified through latent class analyses in two Dutch survey datasets, focusing on depression and alexithymia (DA-dataset; N = 671) and lifestyle factors (L-dataset; N = 1022). Linear regression analyses were used to assess interoceptive accuracy and attention (by the Interoceptive Accuracy Scale and Interoceptive Attention Scale respectively) among different subgroups. Results Multimorbid subgroups were characterized by older age, low socioeconomic position, and elevated physical, psychological, and behavioral problems. Multimorbid classes exhibited lower interoceptive accuracy (DA-dataset: B = -1.14, 95% CI = [-2.89, 0.62]; L-dataset: B = -2.36, 95% CI = [-3.83, -0.89]) and higher attention (DA-dataset: B = 3.62, 95% CI = [0.97, 6.27]; L-dataset: B = 1.07, 95% CI = [-1.42, 3.56]) compared to healthier classes. Conclusion Multimorbid populations demonstrated lower interoceptive accuracy and higher interoceptive attention. This highlights the psychosocial complexity of multimorbid populations which may impact their self-management and health behavior. These findings underscore the need to expand treatments to include psychosocial domains for multimorbid patients.

14
Parental educational attainment polygenic scores contribute to phenotypic heterogeneity in offspring with autism

Gao, S.; Sui, Y.; Tian, P.; Rao, X.; Yan, C.; Xu, Y.; Wang, T.

2026-06-08 genetic and genomic medicine 10.64898/2026.06.03.26354779 medRxiv
Top 1%
0.1%
Show abstract

Educational attainment-related polygenic scores have been implicated in autism spectrum disorder (ASD), but how parental polygenic scores shape offspring phenotypes remains unclear. Using genotyping and exome-sequencing data from 142,357 individuals (55,252 ASD cases) in a large ASD cohort, we dissected the direct and indirect genetic effects of educational attainment-related polygenic scores on ASD phenotypes. Trio-model analyses showed that parental polygenic scores for educational attainment (PGSEA ) were associated with milder core ASD symptoms, including social deficits and repetitive behaviors, predominantly through indirect genetic effects, whereas their associations with comorbidities were driven predominantly by direct genetic effects. PGSEA was also significantly negatively associated with rare variant burden and prenatal factors, although these factors contributed largely independently to most phenotypes. Adjustment for full-scale intelligence quotient (FSIQ) and socioeconomic status (SES) partially attenuated the indirect effects of PGSEA on offspring phenotypes. Finally, higher parental PGSEA was associated with later age at diagnosis in offspring, partly through its protective effects on ASD phenotypes. These findings indicate that indirect genetic effects of parentalPGSEA contribute substantially to phenotypic variation in ASD and highlight family-mediated pathways as an important component of ASD heterogeneity.

15
Maternal deaths associated factors in the Conflict-Affected North West Region of Cameroon. Lessons from a cross-sectional survey

ACHUONDOU, E. E.; Ayaba, U. W.; Kuma, A. C.; Talla, K. E.

2026-06-11 public and global health 10.64898/2026.06.10.26355370 medRxiv
Top 2%
0.0%
Show abstract

Background Maternal mortality is a significant global public health crisis, particularly in sub-Saharan Africa and conflict-affected regions. Cameroon's maternal mortality ratio is high at 406 deaths per 100,000 live births, while the ongoing Anglophone conflict has further exacerbated maternal healthcare delivery in the North West Region (NWR){middle dot} Despite the evidence-based interventions like partographs, obstetric kits, birth preparedness plans, and active management of the third stage of labour, implementation gaps persist across health facilities. Objective The study aimed to assess factors related to preventable maternal deaths in the NWR of Cameroon by exploring maternal health service usage, implementation of obstetric measures, demand-side challenges, accessibility barriers, and health system weaknesses. Methodology The study employed a quantitative descriptive cross-sectional survey design{middle dot} Data was collected with structured questionnaires from postpartum women and healthcare workers in selected health facilities and catchment communities in the NWR{middle dot} Also, a multistage sampling technique was adopted, and Cochran's formula generated a sample size of 109 respondents{middle dot} In addition, data were analysed using SPSS version 27 and Stata version 18, employing descriptive and inferential statistics. Results In this study, while 70{middle dot}64 percent of females attended at least 4 ANC visits, only 38{middle dot}53 percent met WHO ANC adequacy requirements. Facility delivery was 96{middle dot}33 percent, yet only 38{middle dot}46 percent received completed delivery plans. Conflict-related challenges affected access, with 44{middle dot}95 percent reporting insecurity-associated movement difficulties, while 44{middle dot}95 percent reported increased transportation expenses due to the conflict. Near-miss complications were reported among 27.52 percent of participants. Delivery record reviews indicated that obstetric kits were utilised in 81{middle dot}76 percent of deliveries, partographs were accessible in 86{middle dot}49 percent of records but correctly filled in just 60{middle dot}81 percent , while oxytocin administration was 95{middle dot}95 percent. Integrated Health Centres showed poorer adherence with intrapartum interventions compared with District and Regional Hospitals (p <0{middle dot}05). Conclusion In the NWR, maternal mortality was associated with accessibility, interconnected demand-side, conflict-related, and health-system determinants. While utilization of some maternal interventions was high, major implementation gaps, such as weak referral systems, insufficient BEmONC readiness, poor partograph compliance, and conflict disruptions, continually compromise neonatal and maternal outcomes. Strengthening lower-level facilities, enhancing emergency referral systems, and improving implementation of evidence-based obstetric interventions are crucial for minimising maternal mortality in the NWR.

16
Global practices in paediatric olfactory dysfunction: a cross-sectional survey of paediatric ENT surgeons

Spencer, G. M.; Karim, K.; Dzioba, A.; Graham, M. E.; You, P.; Hummel, T.; Gellrich, J.; Coyle, P.; Burns, H.; Peer, S.; Zawawi, F.; Lechien, J. R.; Schriever, V. A.; Bhargava, E. K.; Whitcroft, K. L.

2026-06-06 otolaryngology 10.64898/2026.06.04.26354942 medRxiv
Top 2%
0.0%
Show abstract

Background: Olfactory dysfunction (OD) in children remains underdiagnosed and poorly characterised. Despite its known impacts on nutrition, quality of life, safety awareness, and psychosocial development, no standardised diagnostic or management pathway currently exists for paediatric OD. This study aimed to characterise global practice patterns and identify diagnostic and therapeutic challenges unique to paediatric care. Methodology/Principal: A 44-item cross-sectional online survey was distributed to a verified international network of paediatric otolaryngologists across 36 countries via a closed professional platform. The survey assessed five domains: diagnostic practices, management protocols, technology and innovation, education and training, and barriers to effective care. Regional grouping was used to facilitate meaningful statistical comparisons. Categorical variables were evaluated using chi-square tests, with odds ratios and 95% confidence intervals reported for significant findings. Results: Of 351 potential participants, 167 responded (47.6% response rate). Most respondents (83%) reported seeing children with OD, yet 95% saw fewer than ten such patients annually. Psychophysical testing was never performed by 54.8% of respondents, while 88.4% routinely ordered cross-sectional imaging. Testing frequency increased significantly with patient age (Cochran's Q p<0.001). The most common barriers to objective testing were insufficient training (44.3%), time constraints (29.9%), and funding limitations (28.1%). Multidisciplinary collaboration was negligible. Significant regional variation was observed across most practice domains. Conclusions: Paediatric OD care is characterised by functional underinvestigation, fragmented multidisciplinary collaboration, and systemic educational gaps. These findings support urgent development of standardised clinical guidelines, age-appropriate validated assessment tools, and formal interdisciplinary care pathways.

17
Documented clinical genetic testing among carriers of hereditary breast and ovarian cancer variants: Ancestry and socioeconomic disparities in the All of Us research program

Yerukala Sathipati, S.; Scott, H.

2026-06-10 oncology 10.64898/2026.06.09.26355262 medRxiv
Top 2%
0.0%
Show abstract

Importance: Hereditary breast and ovarian cancer (HBOC) variant carriers benefit from risk-reducing interventions, but only if identified. The extent to which carriers are clinically recognized, and whether recognition is equitable across diverse populations, is poorly characterized in a single large U.S. cohort. Objective: To estimate P/LP HBOC carrier prevalence across genetic ancestry groups, quantify documented clinical genetic testing among carriers, and evaluate ancestry and socioeconomic disparities in testing. Design, Setting, and Participants: Cross-sectional analysis of the All of Us Research Program Controlled Tier (Curated Data Repository v8/C2024Q3R9), comprising participants with short-read whole genome sequencing and linked electronic health record (EHR) and survey data. Carriers were ascertained from research genomic data independent of clinical testing. Exposures: Genetically inferred ancestry (African [AFR], Admixed American [AMR], East Asian [EAS], European [EUR], Middle Eastern [MID], South Asian [SAS]); self-reported household income and educational attainment. Main Outcomes and Measures: (1) Carrier prevalence with Wilson 95% CIs; (2) documented clinical genetic testing (procedure codes) among carriers; (3) adjusted odds of documented testing among women, by ancestry, before and after socioeconomic adjustment, using multivariable logistic regression. Results: Among 414,830 participants, P/LP HBOC carrier prevalence was 1.42% (95% CI, 1.38-1.45) overall and similar across ancestry groups (AFR 1.24%, AMR 1.32%, EAS 1.19%, EUR 1.52%, MID 1.68%, SAS 1.33%; overlapping CIs). Among 250,071 women in the testing analysis, documented clinical genetic testing was rare: only 74 of 5,878 carriers overall (1.3%) and 59 of 3,572 European-ancestry carriers (1.7%) had a documented test, with counts below reportable thresholds in all other ancestry groups. African-ancestry women had lower adjusted odds of documented testing than European-ancestry women (Model 1 adjusted odds ratio [aOR], 0.32; 95% CI, 0.27-0.39), an association that attenuated but persisted after adjustment for income and education (Model 2 aOR, 0.48; 95% CI, 0.40-0.58; P < 0.001); Admixed American women also had reduced adjusted odds (aOR, 0.71; 95% CI, 0.61-0.84). Lower income and lower education were independently and dose-dependently associated with lower testing odds (income <$25,000 aOR, 0.46; high-school education aOR, 0.54). Conclusions and Relevance: High-risk HBOC variant carriers are present across all ancestry groups at similar frequencies, yet documented clinical genetic testing was disparate in the different ancestry groups. African-ancestry women experience a testing gap that is not fully explained by socioeconomic position, implicating structural barriers in access and referral. Population-level strategies that decouple carrier identification from current referral pathways may be required to close this gap.

18
PCRAgent: A Multi-Agent Framework for Transforming Noisy clinical conversations into Structured Pre-Consultation Medical Records and Reusable Clinical Data Resources

Zhang, M.; Zhao, J.; Tang, W.; Xing, J.; Li, J.; Zhang, H.; Qiu, J.; Zhang, Y.

2026-06-11 health informatics 10.64898/2026.06.10.26355372 medRxiv
Top 2%
0.0%
Show abstract

In primary care and outpatient settings, clinically important patient information is often embedded in fragmented, ambiguous, repetitive, and noisy communication between physicians and patients. This limits physicians ability to obtain a clear preconsultation overview of symptoms, history of present illness, and visit intent, while also preventing real world clinical dialogues from being reused in hospital information systems and medical artificial intelligence applications. To address this challenge, we developed PCRAgent, a centrally coordinated multi agent framework for preconsultation clinical information organization. Guided by physician inquiry logic, PCRAgent identifies, extracts, corrects, and standardizes patient-reported information from noisy consultations. Its coordinated modules including error detection, semantic editing, output control, contextual memory, and intent recognition enable robust parallel handling of spelling errors, repetitions, grammatical inconsistencies, medical ambiguities, and non-medical interference. A traceable edit list records intermediate corrections and context, allowing iterative refinement without redundant modifications. PCRAgent generates two complementary outputs. One is a PreConsultation Clinical Report for rapid physician review. The other is a Structured Clinical Conversation Dataset for hospital data construction and downstream AI applications. In evaluations using 220000 strongly perturbed consultations, PCRAgent maintained high robustness, achieving a clinical information accuracy of 4.99 out of 5 and key element completeness of 5 out of 5, outperforming GPT4o. Expert review of Chinese and English dialogues confirmed high clinical accuracy of 4.85 out of 5 and high safety of 4.79 out of 5. Multicenter validation in real-world outpatient workflows further demonstrated practical utility. These findings indicate that PCRAgent can efficiently transform noisy and unstructured consultations into physician ready reports and AI ready structured data, improving outpatient efficiency, reducing cognitive burden, ensuring information completeness, supporting precise decision-making, and enabling high-quality reuse of clinical data.

19
Low-Dose Aspirin Adherence Following Objective cell-free RNA-Based Preeclampsia Risk Testing: A Real-World Survey Study

Moe, A. B.; Haverty, C.; Lee, M.; Hahn, S. E.; McElrath, T. F.; Jain, M.; Rasmussen, M.; Corso, A.; Larson, M. L.; Morrison, H.; Melroy, L. M.; Roofeh, J.; Phelps-Sandall, B.; Kiefer, D.; Biggio, J. R.

2026-06-10 obstetrics and gynecology 10.64898/2026.06.08.26355195 medRxiv
Top 2%
0.0%
Show abstract

Introduction: Preeclampsia (PE) is a leading cause of maternal and neonatal morbidity and mortality, and low-dose aspirin (LDA) prophylaxis is the cornerstone of evidence-based prevention. Despite guideline recommendations, LDA adherence remains poor, with 10-25% of moderate-risk patients taking aspirin. Objective personalized risk stratification using biomarkers has been shown to motivate behavior change in other disease contexts. Survey data suggest that patients are more motivated to take aspirin if informed by an objective predictive test. Here, we report real-world LDA adherence among patients who received a high-risk result from a cell-free RNA (cfRNA) PE risk prediction test. Methods: This retrospective, observational survey study included asymptomatic patients of advanced maternal age (AMA; [&ge;] 35 years at delivery) with singleton pregnancies without USPSTF-defined preexisting high-risk conditions for PE who received the cfRNA PE risk prediction test. Patients who opted in to receive text message surveys were asked about LDA use following receipt of test results. High adherence was defined as reporting LDA use on at least 6 of 7 days per week at least 85% of the time surveyed. The primary analysis included patients with a high-risk test result and at least one LDA frequency survey response following receipt of test result. The observed proportion of adherent patients was compared to a baseline estimate of 25% using an exact binomial test. Results: Of 166 patients who received a cfRNA PE risk prediction test result, 48 (28.9%) received a high-risk result. Of these, 29 (60%) opted in and responded to at least one survey, constituting the primary analysis population. Twenty-seven of the 29 (93.1%; 95% CI: 78.0-98.1%) were classified as highly adherent, significantly higher than the 25% baseline adherence estimate for moderate-risk patients (p < 0.0001). Conclusion: Among surveyed patients who received a high-risk cfRNA PE test result, the proportion classified as highly adherent to LDA (93%) substantially exceeded published estimates of adherence in a similar patient population and met the clinically meaningful threshold of [&ge;] 80% associated with reduced risk of preterm preeclampsia. These findings indicate that objective and personalized biomarker risk testing may be a powerful driver of behavior change that current guidelines have failed to produce.

20
Topological Deep Learning Identifies Polygenic Variant Clusters Across Familial Multimorbid Disorders

Vomo-Donfack, K. L.; Bousquet, G.; Falgarone, G.; Ginot, G.; Morilla, I.

2026-06-09 health informatics 10.64898/2026.06.03.26354242 medRxiv
Top 2%
0.0%
Show abstract

Whole-genome sequencing comprehensively captures coding, non-coding and structural variation in families with suspected inherited disorders, yet its clinical utility remains constrained by an interpretation bottleneck: selecting a handful of relevant variants from millions of candidates. Current rule-based pipelines, anchored in ACMG/AMP criteria, excel at identifying highly penetrant Mendelian alleles but frequently miss variants of low-to-moderate penetrance, non-coding alterations and germline-somatic interactions. Here we introduce PolyCLIP-T, a topology-guided multimodal framework that transforms variant selection from a classification problem into a geometric discovery task. By contrastively aligning DNA-sequence embeddings with functional annotations, PolyCLIP-T constructs a unified latent space in which the displacement between reference and alternate embeddings quantifies the molecular perturbation induced by each variant. Persistent homology then identifies stable topological components - coherent variant groups shared among affected relatives - that transcend single-variant scoring logic. Applied to six families with multi-morbid cancer, autoimmune and cardiovascular disease, PolyCLIP-T recovered non-coding and structural candidates overlooked by conventional pipelines and revealed pleiotropic networks spanning disease categories. This approach provides an interpretable, scalable solution for genome-first investigations of disorders driven by polygenic architectures that evade single-variant analysis. The framework was developed and benchmarked on deeply characterised familial cohorts selected for transgenerational multimorbidity; validation in larger, independent populations will be essential to establish its generalisability. An interactive web tool is freely available at https://www.polyclip-t.uma.es/.